Prevent collapsing batch dims in dot ops with constants #2823

shivadbhavsar · 2024-02-23T01:05:21Z

This simplifies many reshape -> dot -> reshape patterns that are not handled by the find_reshape_reshape_dot pass (ie. in gemms where one input is a constant).

This also simplifies the reshape found in #2736

codecov · 2024-02-23T21:59:10Z

Codecov Report

Attention: Patch coverage is 97.72727% with 1 line in your changes missing coverage. Please review.

Project coverage is 91.93%. Comparing base (30cab64) to head (92b2246).
Report is 150 commits behind head on develop.

Files with missing lines	Patch %	Lines
src/simplify_reshapes.cpp	97.72%	1 Missing ⚠️

Additional details and impacted files

@@           Coverage Diff            @@
##           develop    #2823   +/-   ##
========================================
  Coverage    91.92%   91.93%           
========================================
  Files          489      489           
  Lines        19275    19301   +26     
========================================
+ Hits         17719    17744   +25     
- Misses        1556     1557    +1

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

shivadbhavsar · 2024-02-23T22:32:17Z

SDXL Pref results for reference:

Torch-MIGraphX (end to end):
Before PR: 2850 ms
With PR: 2801 ms

ONNX Unet (4x attn trim):
Before PR: 5.54 ms
After PR: 5.52 ms

As expected, it doesnt affect the onnx version much because there is an extra convert in the middle. Once the convert is handled, the perf number reduces to 5.47ms.

pfultz2 · 2024-02-23T22:53:11Z

src/simplify_reshapes.cpp

+
+        auto sq_const =
+            m.insert_instruction(mbr, make_op("squeeze", {{"axes", sq_axes}}), constant);
+        m.replace_instruction(mbr, mbr->get_operator(), sq_const);


Couldn't we replace it with broadcast instead?

This is just removing any unnecessary preceding dims in literals eg. {1, 1, 640, 640) which are later broadcasted to something like {2, 32, 640, 640}. Would broadcast work for this? I thought it only does 1 axis

src/simplify_reshapes.cpp

migraphx-bot · 2024-02-26T21:12:36Z

Test	Batch	Rate new 048bcd	Rate old dc028d	Diff	Compare
torchvision-resnet50	64	1,703.96	1,489.70	14.38%	🔆
torchvision-resnet50_fp16	64	3,796.06	1,346.10	182.00%	🔆
torchvision-densenet121	32	1,445.65	1,440.50	0.36%	✅
torchvision-densenet121_fp16	32	2,424.59	2,416.72	0.33%	✅
torchvision-inceptionv3	32	878.68	881.27	-0.29%	✅
torchvision-inceptionv3_fp16	32	1,408.15	1,406.66	0.11%	✅
cadene-inceptionv4	16	406.42	404.25	0.54%	✅
cadene-resnext64x4	16	411.54	410.23	0.32%	✅
slim-mobilenet	64	3,805.08	3,794.28	0.28%	✅
slim-nasnetalarge	64	96.56	94.95	1.69%	✅
slim-resnet50v2	64	1,643.38	1,620.87	1.39%	✅
bert-mrpc-onnx	8	586.19	591.10	-0.83%	✅
bert-mrpc-tf	1	288.53	289.30	-0.27%	✅
pytorch-examples-wlang-gru	1	336.52	378.53	-11.10%	🔴
pytorch-examples-wlang-lstm	1	303.46	266.28	13.96%	🔆
torchvision-resnet50_1	1	440.60	369.37	19.28%	🔆
cadene-dpn92_1	1	244.39	233.66	4.59%	🔆
cadene-resnext101_1	1	187.18	189.16	-1.04%	✅
onnx-taau-downsample	1	203.30	183.13	11.02%	🔆
dlrm-criteoterabyte	1	22.19	21.99	0.92%	✅
dlrm-criteoterabyte_fp16	1	41.47	41.43	0.10%	✅
agentmodel	1	6,060.17	6,337.70	-4.38%	🔴
unet_fp16	2	33.34	33.63	-0.85%	✅
resnet50v1_fp16	1	566.17	521.53	8.56%	🔆
resnet50v1_int8	1	462.93	452.53	2.30%	✅
bert_base_cased_fp16	64	617.49	620.67	-0.51%	✅
bert_large_uncased_fp16	32	192.68	193.85	-0.61%	✅
bert_large_fp16	1	103.66	103.88	-0.21%	✅
distilgpt2_fp16	16	1,150.51	1,187.83	-3.14%	🔴
yolov5s	1	297.67	297.39	0.09%	✅
tinyllama	1	23.21	23.34	-0.53%	✅
vicuna-fastchat	1	133.22	132.19	0.78%	✅
whisper-tiny-encoder	1	240.05	240.52	-0.19%	✅
whisper-tiny-decoder	1	244.55	245.42	-0.35%	✅

This build is not recommended to merge 🔴

migraphx-bot · 2024-02-26T21:12:37Z

✅ bert-mrpc-onnx: PASSED: MIGraphX meets tolerance

✅ bert-mrpc-tf: PASSED: MIGraphX meets tolerance

✅ pytorch-examples-wlang-gru: PASSED: MIGraphX meets tolerance

✅ pytorch-examples-wlang-lstm: PASSED: MIGraphX meets tolerance

✅ torchvision-resnet50_1: PASSED: MIGraphX meets tolerance

✅ cadene-dpn92_1: PASSED: MIGraphX meets tolerance

✅ cadene-resnext101_1: PASSED: MIGraphX meets tolerance

✅ dlrm-criteoterabyte: PASSED: MIGraphX meets tolerance

✅ agentmodel: PASSED: MIGraphX meets tolerance

✅ unet: PASSED: MIGraphX meets tolerance

✅ resnet50v1: PASSED: MIGraphX meets tolerance

✅ bert_base_cased_fp16: PASSED: MIGraphX meets tolerance

🔴bert_large_uncased_fp16: FAILED: MIGraphX is not within tolerance - check verbose output

✅ bert_large: PASSED: MIGraphX meets tolerance

✅ yolov5s: PASSED: MIGraphX meets tolerance

✅ tinyllama: PASSED: MIGraphX meets tolerance

✅ vicuna-fastchat: PASSED: MIGraphX meets tolerance

✅ whisper-tiny-encoder: PASSED: MIGraphX meets tolerance

✅ whisper-tiny-decoder: PASSED: MIGraphX meets tolerance

✅ distilgpt2_fp16: PASSED: MIGraphX meets tolerance

src/simplify_reshapes.cpp

shivadbhavsar and others added 2 commits February 22, 2024 21:02

initial const dot matcher work

f061139

rsp_const_dot matcher

cdc7977

shivadbhavsar self-assigned this Feb 23, 2024

shivadbhavsar mentioned this pull request Feb 23, 2024

mul add transpose dot matcher #2809

Merged

Merge remote-tracking branch 'origin/develop' into const_dot_matcher

8921854

shivadbhavsar marked this pull request as ready for review February 23, 2024 20:59

shivadbhavsar requested a review from causten as a code owner February 23, 2024 20:59

shivadbhavsar requested review from pfultz2 and umangyadav February 23, 2024 20:59

remove handling of convert and fix breaking test case

f147dfc

pfultz2 reviewed Feb 23, 2024

View reviewed changes

src/simplify_reshapes.cpp Outdated Show resolved Hide resolved

umangyadav reviewed Feb 23, 2024

View reviewed changes

src/simplify_reshapes.cpp Outdated Show resolved Hide resolved

umangyadav reviewed Feb 23, 2024

View reviewed changes

src/simplify_reshapes.cpp Outdated Show resolved Hide resolved

umangyadav reviewed Feb 26, 2024

View reviewed changes

src/simplify_reshapes.cpp Outdated Show resolved Hide resolved

remove const constraint in rehsape_dot matcher

cf23737

shivadbhavsar requested review from pfultz2 and umangyadav February 26, 2024 18:34

shivadbhavsar added 2 commits February 26, 2024 18:44

merge develop

15b07bb

remove used once constraint for const_multibroadcast

6819b95

umangyadav approved these changes Feb 26, 2024

View reviewed changes

shivadbhavsar added 3 commits February 26, 2024 22:02

merge develop

6e7a1c8

add matcher to move convert before reshapes

d06e4f8

change reshape_convert matcher to only apply when preceeding dot

efa81d9

shivadbhavsar added the Perf Improve label Feb 28, 2024

combine reshape-dot matchers

6dc309c

pfultz2 reviewed Mar 7, 2024

View reviewed changes

src/simplify_reshapes.cpp Outdated Show resolved Hide resolved

pfultz2 reviewed Mar 7, 2024

View reviewed changes

src/simplify_reshapes.cpp Outdated Show resolved Hide resolved

merge master and resolve conflicts

048bcda

shivadbhavsar requested a review from umangyadav May 29, 2024 19:07

causten requested a review from CharlieL7 May 29, 2024 19:14

Merge branch 'develop' into const_dot_matcher

92b2246

causten merged commit 0da3173 into develop May 31, 2024
45 of 47 checks passed

causten deleted the const_dot_matcher branch May 31, 2024 20:53

lajagapp pushed a commit to lajagapp/AMDMIGraphX that referenced this pull request Jul 8, 2024

Prevent collapsing batch dims in dot ops with constants (ROCm#2823)

d40d999

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Prevent collapsing batch dims in dot ops with constants #2823

Prevent collapsing batch dims in dot ops with constants #2823

shivadbhavsar commented Feb 23, 2024

codecov bot commented Feb 23, 2024 •

edited

Loading

shivadbhavsar commented Feb 23, 2024

pfultz2 Feb 23, 2024

shivadbhavsar Feb 24, 2024

migraphx-bot commented Feb 26, 2024 •

edited

Loading

migraphx-bot commented Feb 26, 2024

Prevent collapsing batch dims in dot ops with constants #2823

Prevent collapsing batch dims in dot ops with constants #2823

Conversation

shivadbhavsar commented Feb 23, 2024

codecov bot commented Feb 23, 2024 • edited Loading

Codecov Report

shivadbhavsar commented Feb 23, 2024

pfultz2 Feb 23, 2024

Choose a reason for hiding this comment

shivadbhavsar Feb 24, 2024

Choose a reason for hiding this comment

migraphx-bot commented Feb 26, 2024 • edited Loading

migraphx-bot commented Feb 26, 2024

codecov bot commented Feb 23, 2024 •

edited

Loading

migraphx-bot commented Feb 26, 2024 •

edited

Loading